Biasing Monte-Carlo Simulations through RAVE Values

نویسندگان

  • Arpad Rimmel
  • Fabien Teytaud
  • Olivier Teytaud
چکیده

The Monte-Carlo Tree Search algorithm has been successfully applied in various domains. However, its performance heavily depends on the Monte-Carlo part. In this paper, we propose a generic way of improving the Monte-Carlo simulations by using RAVE values, which already strongly improved the tree part of the algorithm. We prove the generality and efficiency of our approach by showing improvements on two different applications: the game of Havannah and the game of Go.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Energy Study at Different Temperatures for Active Site of Azurin in Water, Ethanol, Methanol and Gas Phase by Monte Carlo Simulations

The interaction between the solute and the solsent molecules play a crucial role in understanding the various molecular processes involved in chemistry and biochemistry, so in this work the potential energy of active site of azurin have been calculated in solvent by the Monte Carlo simulation. In this paper we present quantitative results of Monte Carlo calculations of potential energies of ...

متن کامل

Combining expert, offline, transient and online knowledge in Monte-Carlo exploration

We combine for Monte-Carlo exploration machine learning at four different time scales: – online regret, through the use of bandit algorithms and Monte-Carlo estimates; – transient learning, through the use of rapid action value estimates (RAVE) which are learnt online and used for accelerating the exploration and are thereafter neglected; – offline learning, by data mining of datasets of games;...

متن کامل

Gyration Radius and Energy Study at Different Temperatures for Acetylcholine Receptor Protein in Gas Phase by Monte Carlo, Molecular and Langevin Dynamics Simulations

The determination of gyration radius is a strong research for configuration of a Macromolecule. Italso reflects molecular compactness shape. In this work, to characterize the behavior of theprotein, we observe quantities such as the radius of gyration and the average energy. We studiedthe changes of these factors as a function of temperature for Acetylcholine receptor protein in gasphase with n...

متن کامل

Probabilistic Multi Objective Optimal Reactive Power Dispatch Considering Load Uncertainties Using Monte Carlo Simulations

Optimal Reactive Power Dispatch (ORPD) is a multi-variable problem with nonlinear constraints and continuous/discrete decision variables. Due to the stochastic behavior of loads, the ORPD requires a probabilistic mathematical model. In this paper, Monte Carlo Simulation (MCS) is used for modeling of load uncertainties in the ORPD problem. The problem is formulated as a nonlinear constrained mul...

متن کامل

Comparative Study of Various Self-Consistent Event Biasing Schemes for Monte Carlo Simulations of Nanoscale MOSFETs

Semiclassical Boltzmann transport has been the principal theory in the field of modeling and simulation of semiconductor technology since its early development. To date, most commer‐ cial device simulations including the full-band Monte Carlo (FBMC) method are based on the solution of the Boltzmann transport equation (BTE) and its simplified derivatives such as the hydrodynamic (HD) equations a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010